SEQUENCE LISTING 


<110> Stomp, Anne-Marie 
Dickey, Lynn 
Gasdaska, John 


<120> Expression of Biologically Active 
Polypeptides in Duckweed 

<130> 40989/237225 

<150> US 60/293,330 
<151> 2001-05-23 

<150> US 60/221,705 
<151> 2000-07-31 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 554 
<212> DNA 
<213> Zea mays 

<400> 1 

gatcaagtgc aaaggtccgc cttgtttctc ctctgtctct tgatctgact aatcttggtt 60 

tatgattcgt tgagtaattt tggggaaagc ttcgtccaca gttttttttt cgatgaacag 120 

tgccgcagtg gcgctgatct tgtatgctat cctgcaatcg tggtgaactt atgtctttta 180 

tatccttcac taccatgaaa agactagtaa tctttctcga tgtaacatcg tccagcactg 240 

ctattaccgt gtggtccatc cgacagtctg gctgaacaca tcatacgata ttgagcaaag 300 

atctatcttc cctgttcttt aatgaaagac gtcattttca tcagtatgat ctaagaatgt 360 

tgcaacttgc aaggaggcgt ttctttcttt gaatttaact aactcgttga gtggccctgt 420 

ttctcggacg taaggccttt gctgctccac acatgtccat tcgaatttta ccgtgtttag 480 

caagggcgaa aagtttgcat cttgatgatt tagcttgact atgcgattgc tttcctggac 54 0 

ccgtgcagct gcgg 554 

<210> 2 
<211> 498 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Duckweed codon optimized nucleotide sequence 
encoding human alpha-2B interferon 

<221> CDS 

<222> (1) . . ♦ (498) 

<400> 2 

tgc gac etc ccc cag acc cac age etc ggg tec cgc cgc acc etc atg 4 8 
Cys Asp Leu Pro Gin Thr His Ser Leu Gly Ser Arg Arg Thr Leu Met 
15 10 15 


1 


ctg ctg gcg cag atg cgc cgc ate teg etc ttc age tgc ctg aag gac 
Leu Leu Ala Gin Met Arg Arg lie Ser Leu Phe Ser Cys Leu Lys Asp 
20 25 30 

cgc cac gac ttc ggc ttc ccg cag gag gag ttc ggc aac cag ttc cag 
Arg His Asp Phe Gly Phe Pro Gin Glu Glu Phe Gly Asn Gin Phe Gin 
35 40 45 

aag gec gag acg ate ccc gtg etc cac gag atg ate cag cag ate ttc 
Lys Ala Glu Thr He Pro Val Leu His Glu Met He Gin Gin He Phe 
50 55 60 

aac ctg ttc age acc aag gac age teg gec gee tgg gac gag ace ctg 
Asn Leu Phe Ser Thr Lys Asp Ser Ser Ala Ala Trp Asp Glu Thr Leu 
65 70 75 80 


96 


144 


192 


240 


288 


etc gac aag ttc tac acc gag ctg tac cag cag etc aac gac ctg gag 
Leu Asp Lys Phe Tyr Thr Glu Leu Tyr Gin Gin Leu Asn Asp Leu Glu 
85 90 95 

gcg tgc gtg ate cag ggg gtt ggg gtt acg gag acg ccg ctg atg aag 336 
Ala Cys Val He Gin Gly Val Gly Val Thr Glu Thr Pro Leu Met Lys 
100 105 HO 


gag gac age ate etc gee gtg cgc aag tac ttc cag cgc ate acg etc 
Glu Asp Ser He Leu Ala Val Arg Lys Tyr Phe Gin Arg He Thr Leu 
115 120 125 

tac etc aag gag aag aag tac age ccg tgc gec tgg gag gtc gtt cgc 
Tyr Leu Lys Glu Lys Lys Tyr Ser Pro Cys Ala Trp Glu Val Val Arg 
130 135 140 

gee gag ate atg cgc tec ttc age ctg age acc aac etc cag gag age 
Ala Glu He Met Arg Ser Phe Ser Leu Ser Thr Asn Leu Gin Glu Ser 
145 150 155 160 

etc cgc tec aag gag taa 
Leu Arg Ser Lys Glu * 
165 


384 


432 


480 


498 


<210> 3 

<211> 96 

<212> DNA 

<213> Oryza sativa 

<400> 3 

accatgeagg tcctgaacac gatggtcaac aagcacttcc tctccctgtc cgtcctcatc 60 
gtcctcctcg ggctgagcag caacctcacc geegge 96 

<210> 4 
<211> 188 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Ala Leu Thr Phe Ala Leu Leu Val Ala Leu Leu Val Leu Ser Cys 

2 


1 

Lys Ser Ser Cys 
20 

Gly Ser Arg Arg 
35 

Leu Phe Ser Cys 
50 

Glu Phe Gly Asn 
65 

Glu Met He Gin 

Ala Ala Trp Asp 
100 

Gin Gin Leu Asn 
115 

Thr Glu Thr Pro 
130 

Tyr Phe Gin Arg 
145 

Cys Ala Trp Glu 

Ser Thr Asn Leu 
180 


5 

Ser Val Gly Cys 

Thr Leu Met Leu 
40 

Leu Lys Asp Arg 
55 

Gin Phe Gin Lys 
70 

Gin He Phe Asn 
85 

Glu Thr Leu Leu 

Asp Leu Glu Ala 
120 

Leu Met Lys Glu 
135 

He Thr Leu Tyr 
150 

Val Val Arg Ala 
165 

Gin Glu Ser Leu 


10 

Asp Leu Pro Gin 
25 

Leu Ala Gin Met 

His Asp Phe Gly 
60 

Ala Glu Thr He 
75 

Leu Phe Ser Thr 
90 

Asp Lys Phe Tyr 
105 

Cys Val He Gin 

Asp Ser He Leu 
140 

Leu Lys Glu Lys 
155 

Glu He Met Arg 
170 

Arg Ser Lys Glu 
185 


15 

Thr His Ser Leu 
30 

Arg Arg He Ser 
45 

Phe Pro Gin Glu 

Pro Val Leu His 
80 

Lys Asp Ser Ser 
95 

Thr Glu Leu Tyr 
110 

Gly Val Gly Val 
125 

Ala Val Arg Lys 

Lys Tyr Ser Pro 
160 

Ser Phe Ser Leu 
175 


<210> 5 
<211> 165 
<212> PRT 

<213> Homo sapiens 


<400> 5 

Cys Asp Leu Pro 
1 

Leu Leu Ala Gin 
20 

Arg His Asp Phe 
35 

Lys Ala Glu Thr 
50 

Asn Leu Phe Ser 
65 

Leu Asp Lys Phe 

Ala Cys Val He 
100 

Glu Asp Ser He 
115 

Tyr Leu Lys Glu 
130 

Ala Glu He Met 
145 

Leu Arg Ser Lys 


Gin Thr His Ser 

5 

Met Arg Arg He 

Gly Phe Pro Gin 
40 

He Pro Val Leu 
55 

Thr Lys Asp Ser 
70 

Tyr Thr Glu Leu 
85 

Gin Gly Val Gly 

Leu Ala Val Arg 
120 

Lys Lys Tyr Ser 
135 

Arg Ser Phe Ser 
150 

Glu 
165 


Leu Gly Ser Arg 
10 

Ser Leu Phe Ser 
25 

Glu Glu Phe Gly 

His Glu Met He 
60 

Ser Ala Ala Trp 
75 

Tyr Gin Gin Leu 
90 

Val Thr Glu Thr 
105 

Lys Tyr Phe Gin 

Pro Cys Ala Trp 
140 

Leu Ser Thr Asn 
155 


Arg Thr Leu Met 
15 

Cys Leu Lys Asp 
30 

Asn Gin Phe Gin 
45 

Gin Gin He Phe 

Asp Glu Thr Leu 

80 

Asn Asp Leu Glu 
95 

Pro Leu Met Lys 
110 

Arg He Thr Leu 
125 

Glu Val Val Arg 

Leu Gin Glu Ser 
160 


<210> 6 
<211> 31 


<212> PRT 

<213> Oryza sativa 


<400> 6 

Met Gin Val Leu Asn Thr Met Val Asn Lys His Phe Leu Ser Leu Ser 

15 10 15 

Val Leu lie Val Leu Leu Gly Leu Ser Ser Asn Leu Thr Ala Gly 
20 25 30 


<210> 7 
<211> 31 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Modified rice alpha-amylase signal peptide 
<400> 7 

Met Gin Val Leu Asn Thr Met Val Asn Lys His Phe Leu Ser Leu Ser 

15 10 15 

Val Leu lie Val Leu Thr Val Leu Ser Ser Asn Leu Thr Ala Gly 
20 25 30 


<210> 8 
<211> 21 
<212> PRT 

<213> Arabidopsis thaliana 
<400> 8 

Met Lys Thr Asn Leu Phe Leu Phe Leu lie Phe Ser Leu Leu Leu Ser 

1 5 10 15 

Leu Ser Ser Ala Glu 
20 


4 


